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Abstract 

We consider a repeated Matching Pennies game in which players have 
hmited access to randomness. Playing the (unique) Nash equilibrium in 
this n-stage game requires n random bits. Can there be Nash equilibria 
(or £-Nash equilibria) that use less than n random coins? 

Our main results are as follows 

• We give a full characterization of approximate equilibria, showing 
that, for any 7 G [0, 1], the game has a 7-Nash equilibrium if and 
only if both players have (1 — 7)n random coins. 

• When players are bound to run in polynomial time with bits of 
randomness, approximate Nash equilibria can exist if and only if 
one-way functions exist. 

• It is possible to trade-off randomness for running time. In particular, 
under reasonable assumptions, if we give one player only O(logn) 
random coins but allow him to run in arbitrary polynomial time with 

bits of randomness and we restrict his opponent to run in time 
, for some fixed fc, then we can sustain an e-Nash equilibrium. 

• When the game is played for an infinite amount of rounds with time 
discounted utilities, under reasonable assumptions, we can reduce 
the amount of randomness required to achieve a e-Nash equilibrium 
to n*, where n is the number of random coins necessary to achieve 
an approximate Nash equilibrium in the general case. 



*This research was partially supported by NSF grants CCF-0829754 and DMS-0652521. 
^This work was done while the author was visiting Northwestern University, Department 
of Electrical Engineering and Computer Science. 
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1 Introduction 



In the classical setting of Game Theory, one of the core assumptions is that 
all participating agents are "fully rational" . This amounts not only to the fact 
that an agent must be able to make optimal decisions, given the other players' 
actions, but also to the fact that he must understand how these actions will affect 
the behavior of all other participants. If this is the Nash equilibrium 

can be viewed as a set of strategies in which each agent is simply computing 
his best response given his opponents' actions. However, in real world strategic 
interactions, people often behave in manners that are not fully rational. There 
are many reasons behind non-rational behavior, we focus on two: limitations on 
computation and limitations on randomness. 

Since the work of Herbert Simon [13 , much research has focused on defining 
models that take computational issues into account. In recent years, the idea 
that the full rationality assumption is often unrealistic has been formalized us- 
ing tools and ideas from computational complexity. It is in fact easy to come 
up with settings, as in Fortnow and Santhanam 4 , in which simply comput- 
ing a best response strategy involves solving a computationally hard problem. 
Furthermore there is strong evidence that, in general, the problem of finding 
a Nash equilibrium is computationally difficult for matrix games (Daskalakis, 
Goldberg and Papadimitriou |3|, Chen and Deng [2]). 

Traditionally bounded rationality has focused on two computational re- 
sources: time and space. In this paper we focus on another fundamental re- 
source: randomness. 

It is a basic fact that games in which agents are not allowed to randomize 
might have no Nash equilibrium. In this sense, randomness is essential in game 
theory. We focus on a simple two player zero-sum game that captures this: 
Matching Pennies (Figure [T]). Specifically, we consider the repeated version of 
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Figure 1: The payoff bimatrix for the Matching Pennies game. 

Matching Pennies, played for n rounds. In this game, the unique Nash equilib- 
rium is the one in which, at every round, both players choose one of their two 
strategies uniformly at random. The algorithm that implements this strategy 
requires n random coins, one for each round. The main question we address in 
this paper is: can there be Nash equilibria if the amount of randomness available 
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to both players is less than n? 

First we show that, in general, the game cannot have a Nash equilibrium 
in which both players have only a fraction of n random coins. In particular, 
when we give both players n(l — 7) random coins, we can only achieve a 7-Nash 
equilibrium. This turns out to be tight, in the sense that we can show that any 
game with a 7-Nash equilibrium both players must have at least n(l — 7) coins. 

The proof of this fact, however, relies on the players' ability to implement a 
strategy that runs in exponential time. We then consider games in which the 
players' strategies are polynomially-bounded. Using ideas developed in cryp- 
tography and computational complexity we show that, in this setting, e-Nash 
equilibria that use only coins exist if and only if one-way functions exist. 

We also show that the amount of randomness can be "traded" for time. 
If we allow one of the players to run in arbitrary polynomial time, but use 
only 0(log n) bits, we can still achieve a e-Nash equilibrium if we restrict his 
opponent to run in time n'^' for some fixed k > 0, while giving him random 
bits. 

Finally we consider an infinitely repeated game with time discounted utili- 
ties. In this case, in general, for any discount factor 6 and approximation e, we 
can always achieve a e-Nash equilibrium with only n random coins, if n is large 
enough. When we limit players' strategies to polynomial size circuits, we can 
reduce the amount of randomness to , for any 5 > 0. 

Related Work 

There are many recent approaches to bounded rationality using a computational 
complexity perspective. For instance Halpern and Pass 7, study games in which 
players' strategies are Turing machines. The idea of considering randomness as 
a costly resource in game theory has received only limited attention. Kalyanara- 
man and Umans jllj study zero-sum games, and give both an efficient deter- 
ministic algorithm for finding £-Nash equilibria, as well as a weaker, but more 
general, result in the spirit of our Lemma 13. 1[ giving a randomness-efficient 
adaptive on-line algorithm for playing repeated zero-sum games. Hu [9 also 
considers a similar setting but he is concerned with computability rather than 
complexity. He considers infinitely repeated plays of 2 player zero-sum games 
that have no pure strategy Nash equilibrium, and in which players have a set 
of feasible actions, which represents both the strategies they can play and the 
strategies they can predict. In this setting Hu gives necessary and sufficient 
conditions for the existence of Nash equilibria. Finally Gossner and Tomala [6] , 
give entropy bounds on Bayesian learning in a game theoretic setting, in a more 
general framework then this paper. Their results applied to Matching Pennies 
do not achieve the tight bounds we get in Lemma |3. II 

The rest of the paper is organized as follows. In Section [5] we introduce the 
notation and known results used. Section [3] presents an information theoretic 
impossibility result. Section S] considers players whose strategies are limited to 
polynomial sized Boolean circuit families, while Sections [5] and [6] give exten- 
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sions of the main results to complexity pseudorandom number generators and 
infinitely repeated versions of the game. 



2 Background and Definitions 

2.1 Game Theory Notation 

Throughout the paper we consider a repeated game of Matching Pennies. We 
focus on this game because it captures one of the fundamental aspects of ran- 
domness in game theory. Studying such a simple game also allows us to get 
tight bounds. However, variations of our results extend to other similar 2 per- 
son zero-sum repeated games. 

The payoffs at each round are shown in Figure [1] Let h : {H,T} x {H,T} — 
{ — 1, 1}, be the payoff to Player 1 (PI), and —h the payoff for P2. When we allow 
the players to randomize, we denote randomized strategy on A{H,T} 

for player i. Pi's expected payoff in one round is 

E[h{Si,S2)]= Pr(5i=si)Pr(52 = S2)Msi,S2). 

si,S26{H,T} 

Let u : {HjT}"" x {HjT}"" -> {— n, • • • , n} be Pi's cumulative payoff when the 
game is played for n rounds. In the repeated game, mixed strategies can be 
viewed as distribution over sequences of length n that are dependend on the 
opponent's strategy. Given the adversary's strategy, let Ri = (r^^,--- ,r") g 
A{H,T}" denote a randomized strategies for player i. Pi's expected cumulative 
payoff is 

n 

EKi?i,i?2)]=^E[Mr*i,r*)]. 
t=i 

Finally we define the expected average payoff to PI for the n- round game as 

n 

and consequently P2's expected payoff is — E [U{Ri, i?2)]- To denote player's i 
payoff we will sometimes use the standard notation E [U{Ri, R-i)], where Ri is 
player's i mixed strategy and R-i is his opponent's mixed strategy. 

Definition 2.1 (Nash equilibrium). A pair of mixed strategies (i?i,i?2) is a 
Nash equilibrium for the n-stage Matching Pennies game if for i — 1,2; 

E [C/(i?,, i?_,)] > E [U(RlR-^)] for all R[ e A{F, T}". 

In some cases we will consider a relaxed notion of equilibrium, namely e-Nash 
equilibrium. 

Definition 2.2 (e-Nash equilibrium). A pair of mixed strategies (i?i,i?2) is a 
e-Nash equilibrium for the n-stage Matching Pennies game if for i = I, 2: 

F,[U{Ri,R-i)] >'E[UiRi,R-i)] -e for all R'i € A{77,T}". 
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2.2 Complexity and Pseudorandomness 

We give a brief description of the pseudorandomness tools we need for this 
paper. For more details we recommend the textbooks of Arora and Barak [T] 
and Goldreich [5 . 

The model of computation used throughout most of the paper is based on 
Boolean Circuits. We consider circuits with AND, OR and NOT gates, and 
denote by C„ a circuit with n input nodes. A circuit family {CijigN is an 
infinite collection of circuits, intuitively one for each input length. 

The size of a circuit |C„| is the number of gates. A circuit family is poly- 
nomial sized if there is a fc > such that, for all n, |C„| < n*^. The class of 
languages recognizable by families of polynomial sized circuits is called P/Poly. 
Any language that can be decided in polynomial time by a deterministic or 
randomized Turing machine is also in P/Poly. Formally P C BPP C P/Poly. 

A function is one-way if it is easy to compute and hard to invert. 

Definition 2.3 (One-way Function). A one-way function is a polynomial-time 
computable function f : {0,1}" — >■ {0,1}" such that, for all polynomial size 
circuits D and y — f{x) (where x is chosen uniformly at random on {0,1}"^, 
Pr(/(Z?(y)) — f{x)) < n~'^ for all c > and sufficiently large n. 

Informally, two objects are indistinguishable if no polynomial sized circuit 
family can tell them apart with noticeable probability. 

Definition 2.4 (Indistinguishability) . Let X,Y be two random variables on 
{0, 1}". We say that X and Y are computationally indistinguishable if for every 
family of polynomial size circuits {Ci}jgpj, every c > and for sufficiently large 
n 

Pr(C„(X) = l)-Pr(C„(y)-l) 



1 
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A cryptographic pseudorandom number generator (PRNG) is a deterministic 
algorithm whose output is computationally indistinguishable from the uniform 
distribution, provided that it's input is truly random. We will denote by Uk a 
random variable uniformly distributed on {0, l}*"'. 

Definition 2.5 (Cryptographic PRNG). A cryptographic pseudorandom num- 
ber generator is a deterministic polynomial time algorithm G : {0,1}''-"-' 
{0,1}", where l{n) < n is a polynomial time computable function, such that 
G{Ui(^n)) o.i^d U„ are computationally indistinguishable. 

There are two basic properties about pseudorandom number generators that 
we will use. One relates the notion of pseudorandomness to the notion of pre- 
dictability. 

Definition 2.6 (Unpredictable). Let G : {0, 1}'^") ^ {0, 1}" be a polynomial 
time algorithm, and G{x) = {yi, ■ ■ ■ ,yn)- We call G unpredictable if for every 
family of polynomial size circuits {Di}^^^, all c > and for sufficiently large n 

Pr{D, (yi,...,2/,_i) =y,) < ^ -f ^. 
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Intuitively a pseudorandom number generator must be unpredictable, oth- 
erwise we could easily build a test for it by using the predictor circuits. In 
1982 Yao [H] proved the opposite implication, thus establishing the following 
theorem. 

Theorem 2.7 (Yao's Theorem). A polynomial time algorithm G : {0, — >• 
{0, 1}" is unpredictable if and only if G is a pseudorandom number generator. 

Hastad, Impagliazzo, Luby and Levin in 1999 [8 showed how to construct 
pseudorandom number generators with polynomial expansion based on one-way 
functions. 

Theorem 2.8 (PRNG's from one-way functions). One way functions exist if 
and only if for every S > there is a pseudorandom number generator with 
l{n) = n^. 

Cryptographic pseudorandom number generators' main power lies in the 
ability to fool any polynomial sized adversary, while running in polynomial time. 
However, in other areas of complexity, such as derandomization, the crucial issue 
is having a smaller seed. 

Definition 2.9 (Complexity PRNG). A complexity pseudorandom number gen- 
erator is a 2'("-' time computable function G : {0, 1}'(") — > {0, 1}", such that for 
any circuit C of size n 



Pr{G{Ui^,,)) = 1) - Pr(C(G ([/„)) = 1) 



< n 



The essential difference with cryptographic pseudorandom number genera- 
tors is the order of quantifiers. A cryptographic pseudorandom number genera- 
tor fools circuits of an arbitrary polynomial size. The complexity pseudorandom 
number generator fools circuits only of a fixed polynomial size but under the 
right assumptions requires far fewer random bits. 

Impagliazzo and Wigderson [10 building on a series of paper starting with 
Nisan and Wigderson [12 characterize when complexity pseudorandom number 
generators exist. 

Theorem 2.10. There exists L £ DTIME{2°Wn))^ and e > such that no 
circuit of size at most can compute L if and only if there exists a complexity 

pseudorandom number generator with l{n) = Clogn for some C > 0. 



3 Information Theoretic Bounds 

In this section we make no computational assumptions on the players, and show 
that there can be no Nash equilibrium if we limit the amount of randomness 
available to both players. 

Lemma 3.1. For any 7 e [0, 1], if P2 has less than n{l — 7) random bits, then 
PI has a deterministic strategy A that achieves an expected average payoff of at 
least 7. 
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Proof. Wc will give a strategy A = (a^, • ■ • ,a") for PI that achieves a high 
payoff against any strategy B = {b^, - ■ ■ , 6") from P2. 

Player 1 will enumerate all of P2's possible coin flips, and will obtain a set 
of 2"(-'^~'>'^ possible strategies, one of which is the one being used by P2. After 
each play by P2, PI can eliminate all the strategies that do not play that action 
at that round. Let 5* be the set of strategies that are consistent with P2's plays 
up to round t. Initially the set contains 2"'^^~^^ strategies, and, for all t, 

The strategy A for PI is straightforward: at round t, PI will play based 

on the most likely event: he will consider all strategies in S* and play H if the 
majority of strategies in 5* use H at round t and play T otherwise. Let be 
the exact fraction of strategies that are the majority at round t, 



max{|{6* I b' = H}\,\{b' \ =T}\} 



(3.1) 



so that p* G [1/2, 1]. Pi's expected payoff at round t is: 

E [h{a\ 6*)] = - (1 - p*) = 2p* - 1 > 0. 

Thus Pi's average expected payoff is at least 0. To show that PI can actually 
achieve an average expected payoff of 7 we need to consider the amount of 
information PI gains at each round. We define the following potential function 
(j): {l,...,n} ^R: 

t-i 

<P{t) = J2Ha^b'')-log\S^, 
fc=i 

which considers both the accumulated payoff for PI and the log-size of the set 
of consistent strategics. At time t = 1 there are 2"^^"'''^ possible strategics for 
P2, so 0(1) = — n(l — 7). We will now lower bound the expected increase in (j) 
at each round. We can express this as 



E 



.fe=l 



-E 



.fe=i 



-(E [log|5*+i|] -E[log|5*|]) 
= E [/!(«*,&*)] ^(E [log|^'+i|-log|5*l]) 
= 2p*-l-(E [log log 1^*0). 

Now consider E [log [S'*''"^!] . When PI looses he can eliminate a fraction of 
strategies, thus the new set 5*+^ will contain a (1 — p*) fraction of the strategies 
in 5*. On the other hand, when PI wins, [S'*"''"'^! — p*\S^\. To complete the 
analysis we have to consider two cases, since if p* = 1 then E [log [S'*"'"^!] is not 
well defined. 

First assume p* = 1. This happens when all feasible strategics for P2 have 
the same action at round t. In this case PI will win with probability 1, and the 
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size of the set of feasible strategies will stay the same. So, overall, the increase 
in (/) will be 1. 

Now assume p* £ [1/2, !)• Then, the expected size of the set 5**+^ is: 

E[log|5*+i|] = ((l-p*)log(l-p*)|5'|+p*logp*|5*|). (3.2) 

The expected change in log jS**] does not depend on S**, but only on p*, since 

E [log log 1^*1] - [(l-p*)log(l-p*)|5*| 

+p*logp*|5*|] -log|5*| 
= (1 -p*)log(l -p*) +p* logp*. 

So that the overall change in potential when < 1 is 

E [^{t + 1) - > 2p* - 1 - (1 - p*) log(l - p*) - \ogp\ 

which is always at least 1 for > 1/2. 

So, for all p* e [1/2, 1], at each step the potential function ip increases by at 
least 1. Thus, after n rounds we have that 

E [</>(n)] > </)(!) + mm{nE [(f>{t + 1) ~ 

> — n(l — "/) + n ~ nj. 

Since Pi's expected payoff is at least E [(j){n)], this completes the proof. □ 

This result immediately implies that, without any computational assump- 
tion, there can be no equilibrium with less than n random coins. 

Corollary 3.2. For all 71, 72 E [0, 1] such that 71+72 > 0, if PI and P2 have, 
respectively, n(l — 71) and n{l — 72) random coins, then there can be no Nash 
equilibrium in the n- stage Matching Pennies repeated game. 

Proof. Assume, by contradiction, that (S'i,iS'2) is such a Nash equilibrium. By 
Lemma [5711 Pi's expected payoff must be E [{/(S*!, S'2)] > 72 and P2's payoff 
— E \U{Si, S2)] > 71, otherwise they would be better off by using the majority 
strategy in the proof of Lemma 13.11 Summing the two inequalities we get 
71 + 72 < 0, a contradiction, since we assume 71 + 72 > 0. □ 

However, if we limit the amount of randomness available to both players, we 
are still able to achieve an e-Nash equilibrium. Furthermore, if the game has a 
e-Nash equilibrium, then both players must have at least (1 — e)n random coins. 

Theorem 3.3. Let 7 e [0, 1]. The game has a ^-Nash equilibrium if and only 
if both players have n(l — 7) random coins. 

For simplicity we assume 7 is the same for both players, however a similar 
result holds even in the case where the two players have a different amount of 
random coins. 
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Proof. To show the "only-if" impUcation, consider, by way of contradiction, a 
game that has a 7-Nash equihbrium {81,82) but in which both players have less 
than n(l — 7) random bits. Thus there must be a 7' > 7 such that they have 
exactly n(l — 7') random bits. Since {81,82) is a 7-Nash equilibrium, it must 
be the case that, for any strategy 8'^ for PI 

^[U{8i,82)]>^[U{8[,82)]-i. 

By Lemma [57T] we know that both players have a strategy that achieves a payoff 
of at least 7' > 7, so that the above implies E [{/(^i, S'2)] > 0. Applying the 
same argument to P2, we get — E [L''(S'i, S'2)] > 0, a contradiction. 

The "if" part follows from Lemma [3.41 below. □ 

Lemma 3.4. Let 7 G [0, 1]. // both players have n(l — 7) random coins, then 
the game has a ^-Nash equilibrium. 

For simplicity we assume 771 is even. 

Proof. Consider the following strategies: both player use their random coins 
to play uniformly at random for the first n(l — 7) rounds. Thereafter PI will 
always play H , while P2 will alternate between H and T, playing H,T, . . . . We 
claim that this is a 7-Nash equilibrium. 

First notice that no player can improve his payoff in the first n(f — 7) rounds, 
given his opponent's strategy. Let's consider the remaining 71,7 rounds. PI could 
improve his payoff by playing H,T, H,T . . . , however this only increases his 
payoff by 7. This holds also for P2, that could play T,T,. . . , however gaining 
only 7. □ 



4 Computationally Efficient Players 

The proof of Lemma 13.11 in the previous section relies heavily the fact that we 
make no computational assumptions. In particular, to implement the major- 
ity strategy and compute in (|3.ip requires solving #P hard problems, by 
reduction from #SAT. If we restrict the players to run in time polynomial in 
n this particular strategy likely becomes unfeasible. In this setting, under rea- 
sonable complexity assumptions, it is possible to greatly reduce the amount of 
randomness and, at the same time, achieve a e-Nash equilibrium. 

We consider players' whose actions are polynomial size Boolean circuits. A 
strategy is thus a circuit family {Ci}i^jq, such that circuit C;(„) takes as input 
l{n) random coins and outputs the n actions to be played. Notice that this 
definition implies that each agent can simulate any of his opponent's strategies. 

We consider equilibria that use random coins for any (5 > 0. Theorem 14. II 
shows that such e-Nash equilibria exist if and only if one-way functions exist. 

Theorem 4.1. If players are bound to run in time polynomial in n, then, for 
all S > and sufficiently large n, e-Nash equilibria that use only random 
coins exist, where e ~ n^^ for all k > and sufficiently large n 's, if and only if 
one-way functions exist. 
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Proof. The if part is Lemma 14.31 while the only- if part is Lemma 14.41 □ 



As a preliminary result we show that, in our setting, the expected utility 
when at least one player uses a pseudorandom number generator can't be too 
far from the expected utility when playing uniformly at random. 

Lemma 4.2. Assume one-way functions exist, and let G be the strategy corre- 
sponding to the output of a pseudorandom number generator. For any .strategy 
S that runs in time polynomial in n, for all k > and .sufficiently large n, 



E[[/(G,5)] 



< n 



and 



E[C/(5,G)] 



< n 



Proof. We prove only the first inequality, the proof for the second one being 
symmetric. 



Proof by contradiction. Assuming there is a fc > such that 



E [C/(G, S)] 



> 



n ^ for infinitely many n's, we will construct a test T for G, and show that 



Pr(T(G(;7,(„))) =1)-Pr (T ([/„)- 1) 



> n 



(4.1) 



for some c > and infinitely many n's, thus contradicting the assumption that 
G is a pseudorandom number generator. 

First consider the n random variables Ai{G^S), for i = I,-- - ,n, where 
A,(G, S) is simply Pi's payoff at round i. Since YJl=i MG, S) = nC/(G, S"), 



> n 



l-k 



i=l 



This implies that there must be an i such that E [Ai{G, S)] > n'^^ . Fix that i. 

The test T takes as input an n-bit sequence x and generates a sequence of 
plays s according to strategy S. Now T simulates an n-stage repeated Matching 
Pennies game with strategies (x, s). If PI wins the i-th round then it will output 
1, otherwise the output will be 0. In other words, T outputs 1 if and only if 
Ai{x^ s) — 1. Notice that T runs in time polynomial in n. 

When X is drawn from the uniform distribution, PI will win with probability 
1/2, orPr(T(f/„) = l) = l/2. 

Now notice, that since A, e {-1,1}, Pr {A, = 1) = Mi^l±i, 
that 

E [A,] + 1 1 1 
2 ^2^'^2' 

Thus 

Pr (T (Gi (C/,(„))) = 1) - Pr (T (C/„) = 1) 



This implies 



Pr(T(G(t/,(„))) = l) = 



> 



1 

2^' 



which proves the lemma. 



□ 



Lemma 4.3. // one-way functions exist and players are bound to run in time 
polynomial in n, then for every S,k > and for sufficiently large n, the n-stage 
has an n^^-Nash equilibrium in which each player uses at most random bits. 
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Proof. Assume, by contradiction, that one-way functions exist but there are 
values 5 > and fc > such that the game has no n~'^-Nash equihbrium in 
which players use at most random coins. 

Since we assume one-way functions exist, by Theorem 12.81 there exist pseu- 
dorandom number generators that use coins. Assume both players use the 
output of such pseudorandom number generators as their strategies (which we 
call, respectively, Gi and G2). Since we are assuming that this is not a 
Nash equilibrium, one of the players, say Player 1, must have a strategy A such 
that 

E [U (A, G2)] > E [[/ (Gi, G2)] + n-\ (4.2) 
By Lemma [4.21 we can choose a k' > such that 

E [U {A, G2)] > -n-''' + n-K (4.3) 

Pick c = fc' = fc + 1, so that E [U {A, G2)] > n^" for n > 2. This contradicts 
Lemma l4.2[ proving the claim. □ 

We now prove the opposite direction, that is that the existence of Nash 
equilibria that use few random bits implies the existence of one-way functions. 

Lemma 4.4. If for every S > there is a Nash equilibrium in which each player 
uses random bits and runs in time polynomial in n, then one-way functions 
exist. 

Proof. Let {A, B) be such a Nash equilibrium and assume, by contradiction, 
that one-way functions don't exist. This implies that pseudorandom number 
generators can't exist (Goldreich [5]), and so, A and B can't be sequences that 
are computationally indistinguishable from uniform. 

Thus, by Yao's theorem fTheorem l2.7p . we know that there are polynomial 
size circuit families {Gij^gj^j and {-Dijjgj^ such that 

Pr(G,(yi,...,2/,) > 1/2 + <5i 

Pr (A(2i,...,Zi) = z»+i) > 1/2 + (52, 

for some (52 > 0, where A{xi, . . . ,x;j(„)) = (yi, ?/2, • . . , J/n) and B{xi, . . . ,xi.^{n)) = 

(Zl, Z2, . . . , Zn). 

To get a contradiction it is sufficient to show that players are better off by 
using the predictor circuits C and D. Consider Player 1: using Z?, at each 
round he can guess, given the previous history, the opponent's next move with 
probability ^-\- 5i. Thus his expected payoff at any round t is 

E[/i(d*,&*)] > i + ^i-i + Ji = 2,5i, 

where the expectation is over the internal coin tosses of the predictor circuit D. 
The overall expected payoff is 

1 " 

E [[/(£>, B)] > - ^25 = 25i. 
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Now, let w = E [L'^(A, B)] be the value of the expected payoff when players play 
{A,B). Consider the following cases: 

i) w < 0: this implies that Player 1 could gain 26i by using strategy D, 

ii) w > 0: by definition Player 2's expected payoff is — E [U{A, B)] < 0, so 
Player 2 can achieve a higher payoff by using his predictor circuit C, 

In both cases we see that {A, B) can't be a Nash equilibrium, a contradiction. 

□ 

5 Exchanging Time for Randomness 

In this section we determine conditions under which a e-Nash equilibrium can 
arise, given that one of the players has only a logarithmic amount of randomness 
and his opponent must run in time for some fixed k. This shows how we 
can trade off randomness for time; the player with 0(log7i) random bits runs 
in time polynomial in n, while the player with more random bits runs in fixed 
polynomial time. 

Theorem 5.1. Assume there exists f G DT/ME(2°('("))) and e > such that 
no circuit of size at most 2^'^") can compute f and that one-way functions 
exist. Let Player 1 's strategies be circuits of size at most n*' that use at most 
random bits for some fc > 2 + c6, where c > \ is a constant related to the 
implementation of a cryptographic pseudorandom number generator. Assume 
Player 2 has access to only A/logrt random bits. As long as M > Ck, where C 
is the constant in Theorem \2.10\. then for all e > and sufficiently large n there 
is a e-Nash equilibrium. 

Proof. Let Gi be the cryptographic pseudorandom number generator available 
to Player 1 and G2 be the complexity pseudorandom number generator used by 
player 2. Furthermore let Si be the set of all possible strategies for PI (for all 
(5 > circuits of size at most n'^ that use random bits), and S2 the set of 
strategies available to P2 (polynomial size circuit families and M\ogn random 
coins). We will show that for all e > and sufficiently large n, (Gi,G2) is a 
e-Nash equilibrium with the required properties. 

First we argue that, for all 7 > and sufficiently large n, |E [J7(Gi, G2)] | < 
7. The proof of this fact is similar to the proof of Lemma 14.21 showing by way 
of contradiction, that if |E [[/(Gi,G2)] | > 7 then we can build a test for the 
cryptographic pseudorandom number generator Gi . 

Now we show that, for the appropriate setting of parameters, Gi fools 52 and 
G2 fools Si. For any /c. Player 2 can fool circuits of size by using Glogn'^ = 
Ck log n random bits. So, for M > Gfc, G2 fools Si. Notice also that since Player 
2 runs in time 0{n^''), the cryptographic pseudorandom number generator Gi 
fools S2 . Let h be the one-way permutation used by the pseudorandom number 
generator Gi, and assume h us computable in time n'^ for some c > 0. Given h, 
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Gi is defined as follows: let x,y G {0,1} "2 , and let {x,y) be Gi's seed (notice 
that K^,?/)! = n^), then 

Giix, y) = (r(x) y, r-\x) y, . . . , f{x) y) , 

where x G y = Xiyi mod 2. There are 0{n^) applications of /i, so Gi runs 
in time 0(n2+='5). So, for fc > 2 + c(5, Gi fools 52- 

At this point we're almost done. As in Lemma 14.31 assume, by contradiction, 
that the assumptions in the theorem hold but (Gi,G2) is not a e-Nash equi- 
librium for some e > 0. This implies that at least one of the two players can 
improve his expected payoff by more than e by switching to some other strategy. 
First consider P2, and assume there is a strategy S2 G ^2 such that 

E[t/(Gi,52)] >E[C/(Gi,G2)]+e. 

As in Lemma 14.31 this implies that 5*2 would be a test for the cryptographic 
pseudorandom number generator Gi, contradicting the fact that Gi fools ^2. 
Similarly, assume PI has a strategy 5*1 G iSi such that 

E[t/(^i,G2)] >E[C/(Gi,G2)]+e. 

Again, Si can easily be made into a test for G2, contradicting the fact that G2 
fools iSi. □ 



6 Infinite Play 

We now consider an infinitely repeated game of Matching Pennies, and show 
that, if utilities are time discounted, we can always achieve a e-Nash equilibria 
using a large enough (but finite) amount of random coins. First we determine 
the least amount of randomness required to achieve a e-Nash equilibrium in the 
general, i.e. computationally unbounded, case. 

Lemma 6.1. For all discount factors 5 S (0, 1) and all e > 0, there is an e-Nash 
equilibrium in which the players use only n random bits, for n > ^^^^i~T^- 

Proof. Given 5,£ > consider the following strategies: both players play the 
Nash equilibrium strategy for the first n rounds. After this PI will always play 
H, while P2 will play H and T alternatively. The overall expected payoff is 
0. However, after round n, both players could switch to a strategy that always 
wins, achieving a total expected payoff of + X^t^n = T^- To ensure that 
our strategies are indeed a e-Nash equilibrium we just need to make sure that 

(5" 

Rearranging and taking logarithms we get n > ^^^^j^jg-^- CH 
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Now we consider players' whose strategies are families of polynomial size 
Boolean circuits (as in Section |4]), and assume one-way functions exist. We first 
give a version of Lemma 14.21 for time discounted utilities on a finite number of 
rounds. 

Lemma 6.2. Assume one-way functions exist, and let G = (g^, . . . ,5") be the 
strategy corresponding to the output of a cryptographic pseudorandom number 
generator. Let S — (s^, . . . , s") be any strategy. For all 6 G (0, 1), k > and 
for sufficiently large n 



E 



J2S'h{g',s') 



< n 



id E 



S'his\g') 



< n 



for some 5, k and infinitely many n's. 
. . . , A„(c", C2 ), defined as At{c\,c\) = 



Proof. Again we give the proof only for the first inequality. Assume, by contra- 
diction, that |E E'tLi 5^h{g\s*)] \ > n' 
Consider the random variables j4i(c}, C2) 
1 if PI wins round t when playing according to (Ci,C2) and otherwise. Let 
^(Ci, C2) = Et S'Aticlci), so that E [A{G, S)] > |E ELi S'h{g\s')] \ > n-K 
This implies that there is a t such that 5*E [At{g*, s*)] > n^^~^ , which implies 
E [yl((5*,s*)] > n~^~^. Fix that t. As in Lemma l4?2| consider the test T that. 



given a sequence of plays x, generates a play s from S and outputs 1 if PI wins 
round t and outputs otherwise. 

When X is drawn uniformly at random, Pr (r(C/„) = 1) = 1/2. On the other 
hand, when x is G"s output. 



Pr(r(G(f/,(„))) = 1) =E[At] > 



Now 



for c > 



Pr (r(G(t/,(„))) = 1) - Pr (r(C/„) = 1) 

log(l/2-n-^-'') 



1 

>2 



j^^^ > 0, contradicting the assumption that G is a pseudo- 
random number generator. □ 

Using the above Lemma we can show that, for all discount factors, we can 
greatly reduce the amount of random coins needed to get an e-Nash equilibrium. 

Lemma 6.3. For all discount factors S G (0,1), all s > and all ^ > 0, 
there is a n~'^-Nash equilibrium in which players use only random coins, for 
sufficiently large n 's. 

Proof. As in the proof of Lemma [01 we consider the following strategy for both 
players: for the first n rounds play the output of a cryptographic pseudorandom 
number generator G, with seed length n^. Thereafter PI will always play H, 
while P2 will alternate between H and T. Pick any A; > 0, we now show that 
this is a n~'^-Nash equilibrium. By Lemma [6.21 we can pick k' — k/2 such that 

-'=',n-'='l. To 



the expected utility in the first n rounds lies in the interval [—n 
ensure that this is a n 



''-Nash equilibrium we just need to show that 
5" 



-n 



{l~5) 



n 
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or 



(1-5)- 

Now, for any c > 0, if we set k' = '""^''^logl ^"'^^'^ i then the left hand side of the 
above inequahty is c, so that it always holds for sufficiently large n's. □ 

Thus, given any n that satisfies the conditions in Lemma |6.1[ there can be 
a n^'^'-Nash equilibrium using coins, for any 5 > 0. To see this, consider an 
m sufficiently large so that Lemma l673l holds and pick ^ = 5 1°^^^ ■ 



7 Conclusions 

We have shown how, in a simple setting, reducing the amount of randomness 
available to players affects Nash equilibria. In particular, if wc make no com- 
putational assumptions on the players, there is a direct tradeoff between the 
amount of randomness and the approximation to a Nash equilibrium we can 
achieve. If, instead, players are bound to run in polynomial time, we can get 
very close to a Nash equilibrium with only random coins, for any 6 > 0. 
Some directions for future research include: 

• Is it possible to extend Lemma [3Tl to m player games, for m > 2? Notice 
that the strategy used in that proof does not generalize to this setting. 

• Under what circumstances is it possible to further reduce the amount of 
randomness available (say to O(logn) for both players)? 

• Is it possible to extend these results to general zero-sum games or even 
non zero-sum games? 



Acknowledgments We wish to thank Tai-Wei Hu, Peter Bro Miltersen, Rahul 
Santhanam and Rakesh Vohra for fruitful discussions. 



References 

[1] S. Arora and B. Barak. Computational Complexity: A Modern Approach. 
Cambridge University Press, 2009. 

[2] X. Chen and X. Deng. Settling the Complexity of Two-player Nash Equi- 
librium. In 47th FOCS, 2006. 

[3] C. Daskalakis, P. W. Goldberg, and C. H. Papadimitriou. The complexity 
of computing a nash equilibrium. SIAM J. Comput., 39(l):195-259, 2009. 

[4] L. Fortnow and R. Santhanam. Bounding Rationality by Discounting Time. 
In 1st ICS, 2010. 



15 



[5] O. Goldreich. Foundations of Cryptography: Basic Applications. Cam- 
bridge University Press, 2004. 

[6] O. Gossner and T. Tomala. Entropy Bounds on Bayesian Learning. Journal 
of Mathematical Economics, 44(l):24-32, 2008. 

[7] J. Y. Halpern and R. Pass. Game theory with costly computation: For- 
mulation and application to protocol security. In 1st ICS, pages 120-142, 
2010. 

[8] J. Hastad, R. Impagliazzo, L. Levin, and M. Luby. A Pseudorandom 
Generator from any One-way Function. SI AM Journal on Computing, 
28(4):1364-1396, 1999. 

[9] T.-W. Hu. Complexity and Mixed Strategy Equilibria. 

|http : //bit ■ ly/e4N8cN| 2010. Working Paper. 

[10] R. Impagliazzo and A. Wigderson. Randomness vs time: Derandomization 
under a uniform assumption. J. Comput. Syst. Sci., 63(4):672-688, 2001. 

[11] S. Kalyanaraman and C. Umans. Algorithms for Playing Games with Lim- 
ited Randomness. In 15th ESA, 2007. 

[12] N. Nisan and A. Wigderson. Hardness vs Randomness. J. Comput. Syst. 
Sci., 49(2):149-167, 1994. 

[13] H. Simon. A Behavioral Model of Rational Choice. The Quarterly Journal 
of Economics, 69(1):99-118, 1955. 

[14] A. Yao. Theory and Application of Trapdoor Functions. In 23rd FOCS, 
1982. 



16 



